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Abstract. Consider Glauber dynamics for the Ising model on a graph 
of n vertices. Hayes and Sinclair showed that the mixing time for this 
dynamics is at least nlogn//(A), where A is the maximum degree and 
/(A) — 0(Alog'^ A). Their result applies to more general spin systems, 
and in that generality, they showed that some dependence on A is nec- 
essary. In this paper, we focus on the ferromagnetic Ising model and 
prove that the mixing time of Glauber dynamics on any n-vertex graph 
is at least (1/4 + o(l))7ilogn. 



1. Introduction 

Consider a finite graph G = {V, E) and a finite alphabet Q. A general 
spin system on G is a probability measure ^ on Q^; well studied examples 
in computer science and statistical physics include the uniform measure 
on proper colorings and the Ising model. Glauber (heat-bath) dynamics 
are often used to sample from /u (see, e.g., [9, 10, 14]). In discrete-time 
Glauber dynamics, at each step a vertex v is chosen uniformly at random 
and the label at v is replaced by a new label chosen from the /i-conditional 
distribution given the labels on the other vertices. This Markov chain has 
stationary distribution /i, and the key quantity to analyze is the mixing time 
^mixi at which the distribution of the chain is close in total variation to /x 
(precise definitions are given below). 

If \V\ = n, it takes (1 + o(l))nlogn steps to update all vertices (coupon 
collecting) , and it is natural to guess that this is a lower bound for the mixing 
time. However, for the Ising model at infinite temperature or equivalently, 
for the 2-colorings of the graph (V, 0), the mixing time of Glauber dynamics 
is asymptotic to nlogn/2, since these models reduce to the lazy random 
walk on the hypercube, first analyzed in [1]. Thus mixing can occur before 
all sites are updated, so the coupon collecting argument does not suffice to 
obtain a lower bound for the mixing time. The first general bound of the 
right order was obtained by Hayes and Sinclair [6], who showed that the 
mixing time for Glauber dynamics is at least nlogn//(A), where A is the 
maximum degree and /(A) = 0(Alog^A). Their result applies for quite 

general spin systems, and they gave examples of spin systems n where some 
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dependence on A is necessary. After the work of [6], it remained unclear 
whether a uniform lower bound of order nlogn, that does not depend on A, 
holds for the most extensively studied spin systems, such as proper colorings 
and the Ising model. 

In this paper, we focus on the ferromagnetic Ising model, and obtain a 
lower bound of (l/4 + o(l))nlog n on any graph with general (non-negative) 
interaction strengths. 

Definitions. The Ising Model on a finite graph G = (V, E) with interaction 
strengths J = {Juv > ■ uv € E} is a probability measure fiQ on the 
configuration space Q = {±1}^, defined as follows. For each cj G $7, 



where Z{J) is a normalizing constant called the partition function. The 
measure fiQ is also called the Gibbs measure corresponding to the interac- 
tion matrix J. When there is no ambiguity regarding the base graph, we 
sometimes write /i for fiQ. 

Recall the definition of the Glauber dynamics: At each step, a vertex 
is chosen uniformly at random, and its spin is updated according to the 
conditional Gibbs measure given the spins of all the other vertices. It is 
easy to verify that this chain is reversible with respect to ^g- 

Next we define the mixing time. Let {Xt) denote an aperiodic irreducible 
Markov chain on a finite state space O with transition kernel P and station- 
ary measure vr. For any two distributions /i, on i}, their total-variation 
distance is defined to be 



be the mixing time with initial state x. (The choice of 1/4 here is by con- 
vention, and can be replaced by any constant in (0, 1/2), without affecting 
the (1/4 + o(l))nlogn lower bound in the next theorem.) The mixing time 
tmix is then defined to be maxx^n ^mix- 
We now state our main result. 

Theorem 1. Gonsider the Ising model (1.1) on the graph G with interaction 
matrix J, and let imix(^' denote the mixing time of the corresponding 




(1.1) 




For X € let ¥x denotes the probability given Xq = x and let 
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Glauber dynamics, started from the all-plus configuration. Then 
inft+JG,J)>(l/4 + o(l))nlogn, 

where the infimum is over all n-vertex graphs G and all nonnegative inter- 
action matrices J. 

Remark. Theorem 1 is sharp up to a factor of 2. We conjecture that 
(1/4 + o(l)) in the theorem could be replaced by (1/2 + o(l)), i.e., the 
mixing time is minimized (at least asymptotically) by taking J = 0. 

Hayes and Sinclair [6] constructed spin systems where the mixing time of 
the Glauber dynamics has an upper bound 0(n log n/ log A). This, in turn, 
implies that in order to establish a lower bound of order n log n for the Ising 
model on a general graph, we have to employ some specific properties of the 
model. In our proof of Theorem 1, given in the next section, we use the 
GHS inequality [5] (see also [7] and [3]) and a recent censoring inequality 
[13] due to Peter Winkler and the second author. 

2. Proof of Theorem 1 

The intuition for the proof is the following: In the case of strong interac- 
tions, the spins are highly correlated and the mixing should be quite slow; 
In the case of weak interaction strengths, the spins should be weakly depen- 
dent and close to the case of the graph with no edges, therefore one may 
extend the arguments for the lazy walk on the hypercube. 

We separate the two cases by considering the spectral gap. Recall that the 
spectral gap of a reversible discrete-time Markov chain, denoted by gap, is 
1 — A, where A is the second largest eigenvalue of the transition kernel. The 
following simple lemma gives a lower bound on t^j^ in terms of the spectral 
gap- 
Lemma 2.1. The Glauber dynamics for the ferromagnetic Ising model (1.1) 
satisfies i+j^ > log2 • (gap-^ - 1). 

Proof. It is well known that tmix > log 2 • (gap~^ — 1) (see, e.g.. Theo- 
rem 12.4 in [9]). Actually, it is shown in the proof of [9, Theorem 12.4] that 
Cix > log2 • (gap^^ - 1) for any state x satisfying f{x) = ||/||oo, where / 
is an eigenfunction corresponding to the second largest eigenvalue. Since 
the second eigenvalue of the Glauber dynamics for the ferromagnetic Ising 
model has an increasing eigenfunction / (see [11, Lemma 3]), we infer that 
either ||/||oo = /(+) or ||/||oo = /(— )• By symmetry of the all-plus and the 
all-minus configurations in the Ising model (1.1), we have t^j^ = t";^, and 
this concludes the proof. ■ 
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Lemma 2.1 implies that Theorem 1 holds if gap^^ > nlogn. It remains 
to consider the case gap"^ < nlogn. 

Lemma 2.2. Suppose that the Glauber dynamics for the Ising model on a 
graph G = iV, E) with n vertices satisfies gap"-"^ < ?ilogn. Then there exists 
a subset F C V of size \_\/n/ log nj such that 

^ 2 

CoY^{a{u),a{v)) <j—. 

Proof. We first establish an upper bound on the variance of the sum of 
spins S = S{a) = '^.i;^y cr{v). The variational principle for the spectral 
gap of a reversible Markov chain with stationary measure vr gives (see, e.g., 
[2, Chapter 3] or [9, Lemma 13.12]: 

/ Var^(/) 

where £{f) is the Dirichlet form defined by 

£{f) = {iI-P)fJ)^ = l [fix)-fiy)fn{x)P{x,y). 

Applying the variational principle with the test function S, we deduce 
that 

Since the Glauber dynamics updates a single spin at each step, £{S) < 2, 
whence 

Var^(5) < <S(5)gap^^ < 2 nlogn. (2.1) 
The covariance of the spins for the ferromagnetic Ising model is non-negative 
by the FKG inequality (see, e.g., [4]). Applying Claim 2.3 below with k = 
to the covariance matrix of a concludes the proof of the lemma. ■ 

Claim 2.3. Let A be an n x n matrix with non-negative entries. Then for 
any k < n there exists F C {1, . . . , n} such that \F\ = k and 

Proof. Let i? be a uniform random subset of {1, . . . , n} with \R\ = k. Then, 
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_ k{k-l) ^ . ^ \^ A 

-n(n-l) ^ ^m1{.^,}<;^2^A,. 
Existence of the desired subset F follows immediately. 
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We now consider a version of accelerated dynamics (Xt) with respect to 
the subset F as in Lemma 2.2. The accelerated dynamics selects a vertex 
V £ V uniformly at random at each time and updates in the following way: 

• If -y F, we update cr{v) as in the usual Glauber dynamics. 

• li V £ F, we update the spins on {v} U F'^ all together as a block, 
according to the conditional Gibbs measure given the spins on F \ 
{v}. 

The next censoring inequality of [13] guarantees that, starting from the all- 
plus configuration, the accelerated dynamics indeed mixes faster than the 
original one. In what follows, write ^ if stochastically dominates fi. 

Theorem 2.4 ([13] and also see [12, Theorem 16.5]). Let {Q,S,V,tt) be 
a monotone system and let ^ he the distribution on which results from 
successive updates at sites vi,...,Vm, beginning at the top configuration. 
Define v similarly but with updates only at a subsequence Vi-,^, . . . ,Vif,. Then 
/i ^ I/, and ll/i — ttIItv ^ ||'^~'^||tv- Moreover, this also holds if the sequence 
vi, . . . , Vm and the subsequence ii, . . . , i/j are chosen at random according to 
any prescribed distribution. 

In order to see how the above theorem indeed implies that the accelerated 
dynamics {Xt) mixes at least as fast as the usual dynamics, first note that 
any vertex u ^ F is updated according to the original rule of the Glauber 
dynamics. Second, for u £ F, instead of updating the block {u} U F^, 
we can simulate this procedure by performing sufficiently many single-site 
updates in {u}[JF''. This approximates the accelerated dynamics arbitrarily 
well, and contains a superset of the single-site updates of the usual Glauber 
dynamics. Theorem 2.4 thus completes this argument. 

Let (Yt) be the projection of the chain (Xt) onto the subgraph F. Re- 
calling the definition of the accelerated dynamics, we see that (Yt) is also 
a Markov chain, and the stationary measure for (Yt) is the projection 
of hg to F. Furthermore, consider the subsequence (Zt) of the chain (It) 
obtained by skipping those times when updates occurred outside of F in 
(Xt). Namely, let Zt = Yxt where Kt is the t-th time that a block {v} U F"^ 
is updated in the chain (Xt). Clearly, {Zt) is a Markov chain on the space 
{—1,1}^, where at each time a uniform vertex v from F is selected and 
updated according to the conditional Gibbs measure fic given the spins on 
F \ {v}. The stationary measure for (Zt) is also vp. 

Let St = X^tjgi;' -^i(^^) be the sum of spins over F in the chain (Zt). It 
turns out that St is a distinguishing statistic and its analysis yields a lower 
bound on the mixing time for chain (Zt). To this end, we need to estimate 
the first two moments of St. 
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Lemma 2.5. Let he an instance of the chain {Zt) started at the 

all-plus configuration. Then its sum of spins satisfies that 

E+(50>|F|(l-r^'* 



Proof. The proof follows essentially from a coupon collecting argument. Let 
(Zj*) be another instance of the chain {Z^), started from up. It is obvious that 
we can construct a monotone coupling between {z['^'^) and {Z^) (namely, 
Zj"^"* > Z^ for all t G N) such that the vertices selected for updating in 
both chains are always the same. Denote by U[t\ this (random) sequence of 
vertices updated up to time t. Note that Z^ has law vp, even if conditioned 



on the sequence U\t\. Recalling that z'l^^ > Z\ and E^o"(t') = 0, we obtain 
that 



E+[Zt^+^(z;) I V G > 0. 
It is clear that Z^^\v) = \\iv ^ [/[t]. Therefore, 

E+[z(+)(^;)]>P(z;0t/[t]) = (l- 
Summing over v £ F concludes the proof. 



1 



We next establish a contraction result for the chain {Zt). We need the 
GHS inequality of [5] (see also [7] and [3] ) . To state this inequality, we recall 
the definition of the Ising model with an external field. Given a finite graph 
G = {V, E) with interaction strengths J = {Juv > : uv £ E} and external 
magnetic field H = : v E V}, the probability for a configuration a E 

= {±1}^ is given by 

(^) = -^TJJ^ exp ( ^ JuvC7{u)a{v) + H{v)a{v)) , (2.2) 

where Z{J,H) is a normalizing constant. Note that this specializes to (1.1) 
if H = 0. When there is no ambiguity for the base graph, we sometimes 
drop the subscript G. We can now state the 

GHS inequality [5]. For a graph G = {V,E), let //^ = fiQ as above, 
and denote by my{H) = '&^H[a{v)] the local magnetization at vertex v. If 
Hv ^ for all v & V , then for any three vertices u,v^w (not necessarily 
distinct), 

d'^mJH) 
dHudHy^ 

The following is a consequence of the GHS inequality. 

Corollary 2.6. For the Ising measure fi with no external field, we have 

k 

^t,[<y{u) \ vi = l for alll <i <k\< ^E^[o-(u) \vi = l]. 

i=l 
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Proof. The function f{H) = mu{H) satisfies /(O) = 0. By the GHS inequal- 
ity and Claim 2.7 below, we obtain that for all H, H' € : 

mu{H + H')<muiH)+muiH'). (2.3) 

For 1 < i < k and h > 0, let H^^ be the external field taking value h on 
Vi and vanishing on 1/ \ {vi}. Applying the inequality (2.3) inductively, we 
deduce that 



Finally, let /i — > c3o and observe that mu{H^) E^[o"(ti) | a{vi) = 1] and 
muiEi Hi) ^ ^p,[(y{u) I a{vi) = 1 for alll < i < A;]. ■ 



that qJq^. < for all x G M" and I < i, j < n. Then for all x,y £ 



Claim 2.7. Write M+ = [0, oo) and let f : W1 R be a C"^ -function such 
I X G M" and 1 < i, j < n. Then 

f{x + y)-f{x)<f{y)-f{0). 

Proof. Since all the second derivatives are non-positive, ^^-^ is decreasing in 

every coordinate with x for all x G M" and i < n. Hence, is decreasing 
inWl. Let 

i * 

It follows that gx{t) < go{t) for all x,y £ M". Integrating over t G [0,1] 
yields the claim. ■ 

Lemma 2.8. Suppose thatn > e^. Let (Zt) be another instance of the chain 
[Zij. Then for all starting states zq and zq, there exists a coupling such that 

[Y,\Zt{v)-Zt{v)\\ < (l--^)*^|zo(„)-io(^)|. 



Proof. Fix rjjf] G { — 1,1}^ such that rj and fj differ only at the vertex v and 
r]{v) = 1. We consider two chains (Zt) and (Zt) under monotone coupling, 
started from r] and f] respectively. Let rj^ be the restriction of r/ to ^ for 
A C F (namely, rjA G { — 1, 1}^ and tia{v) = rj{v) for all v £ A), and write 

ip{u,V,v) = ^ti[cr{u) I o-F\M = Vf\{u}] - ^ti[(^{u) I (^F\{u} = VF\{u}] ■ 
By the monotone property and symmetry of the Ising model, 

ilj{u,ri,fi) < E^[a{u) \ o-^\{„} = +] - E^[(t(u) | <t^\{„} = -] 
= 2E^[c7(u) I crF\{u} = +] ■ 
By Corollary 2.6, 

i;{u,v,fj)<2 IE;.k(n) I a(w;) = 1] = 2 ^ Cov(a(u), a(y;)) . 

w£F\{u} w£F\{u} 
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Recalling the non-negative correlations between the spins, we deduce that 
under the monotone coupling 

v&F ' ' ' ' u<^F\{v} 

uGF\{v} weF\{u} 

By Lemma 2.2, we get that for n > e'^. 



E - 



[\Y.\Z,{v) - Z,{v)\ 



1 2 1 

< 1 - TT^ + TT^n < 1 



|F| |F| logn ~ 2\F\ 



veF 

Using the triangle inequality and recursion, we conclude the proof. ■ 

From the contraction result, we can derive the uniform variance bound on 
St- This type of argument appeared in [8] (see Lemma 2.4) when (Zt) is a one 
dimensional chain. The argument naturally extends to multi-dimensional 
case and we include the proof for completeness. 

Lemma 2.9. Let (Zt) and (Zt) be two instances of a Markov chain taking 
values in M". Assume that for some p < 1 and all initial states zq and zq, 
there exists a coupling satisfying 

where we used the convention that z{i) stands for the i-th coordinate of z for 
z E M". Furthermore, suppose that \ Zt{i) — Zi_i(i)\ < R for all t. Then 
for any t G N and starting state z E M", 

2 ^2 



Var, < Y— 2^^ 



P 

Proof. Let Zt and Z[ be two independent instances of the chain both started 
from z. Defining Qt = Yli ■^t(^) ^'^^ Q't — Yli ■^t(O) obtain that 

\&z[Qt I ^1 = ^i] - MQ't\z't = z[]\ = - ^-4QU]\ 

for all possible choices of zi and z'^. It follows that for any starting state z 

Var,(E,[Qt | Zi]) = \¥.,[{¥.zAQt^i] -^z[\Q't-i]f] < 2{p'^^Rf. 
Therefore, by the total variance formula, we obtain that for all z 

YavM) = Var,(E,[Qt | Zi]) + E,[Var,(Qj | Z^)] < 2{p'~^ Rf + ut-i , 
where ut = max^ Var^(Qt). Thus ut < 2{p*~^Rf + ut^i, whence 

2R^ 



<Y.^n-...,)<Y^2p-^^-^^R-<- 
1=1 i=l ^ 
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completing the proof. ■ 

Combining the above two lemmas gives the following variance bound (note 
that in our case R = 2 and p = 1 — , so 1 — > ) • 

Lemma 2.10. For all t and starting position z, we have \ar z{St) < 16|-F|. 

We can now derive a lower bound on the mixing time for the chain (Zt). 

Lemma 2.11. The chain (Zt) has a mixing timet+.^> i|F|log|F|-20|F|. 

Proof. Let (Z^^^) be an instance of the dynamics (Zt) started from the 
all-plus configuration and let Z* G {—1, 1}^ be distributed as up. Write 

To = i|F|log|F| -20|F| . 



It suffices to prove that 

■'To ' ^ - 4 ' 

:(+) _ V- 



dTv(4t^'5*)>i, (2.4) 



where Sj,^ = YlveF ^To ^^^ ^ before and S* = ^y^p Z* (v) be the sum 
of spins in stationary distribution. To this end, notice that by Lemmas 2.5 
and 2.10: 

K+(4t^) > e20+°(i)v^ and Var+(4t^) < 16|F| . 
An application of Chebyshev's inequality gives that for large enough n 

P+(4+) < e^OvT^) < , ^ < \ . (2.5) 

^0 ~ (g20+o(l) _glO) /|;p|)2 - 4 ^ ^ 

On the other hand, it is clear by symmetry that E^^5* = 0. Moreover, 
since Lemma 2.10 holds for all t, taking t ^ oo gives that Varj^^ S* < 16|-F|. 
Applying Chebyshev's inequality again, we deduce that 

16|F| ^ 1 
(ei0v^)2 " 4 ■ 

Combining the above inequality with (2.5) and the fact that 

dTv(4t^'5*) > 1 - IP+(4t^ < - > e'^yWl) , 

we conclude that (2.5) indeed holds (with room to spare), as required. ■ 

We are now ready to derive Theorem 1. Observe that the dynamics (Yt) 
is a lazy version of the dynamics (Zt). Consider an instance (Y^^) of the 
dynamics (1^) started from the all-plus configuration and let Y* G {—1,1}-^ 
be distributed according to the stationary distribution up. Let S^~^^ and S* 
again be the sum of spins over F, but with respect to the chain (Y^ ^^) and 
the variable Y* respectively. Write 
„ n / 1 



^^,F\log\F\ - AO\F\ 



10 



JIAN DING AND YUVAL PERES 



and let Nt be the number of steps in [1, T] where a block of the form {v}L)F 
is selected to update in the chain (Y"/^-*). By Chebyshev's inequality, 

P(iV^ > i|F| log |F| - 20|F|) < = oil) . 

Repeating the arguments in the proof of Lemma 2.11, we deduce that for 
all t<To = log |F| - 20|F|, we have 

Therefore 

||P+(y^+) E •) - Mtv > 1 - n^T > To) - F^^iS* > ei°vT^) 

- P+(5^+^ < e^^y^l I Nt < To) . 

Altogether, we have that 

||p+(y^+) G O-^fIItv >h + o{i) >l, 

and hence that 

^ + ,Y \ \ l+o(l) , 

where t^-^ refers to the mixing time for chain (Y^^^). Since the chain (Yt) 
is a projection of the chain (Xt), it follows that the mixing time for the 
chain (Xt) satisfies t^^-^ > (1/4 + o(l))nlogn. Combining this bound with 
Theorem 2.4 (see the discussion following the statement of the theorem), we 
conclude that the Glauber dynamics started with the all-plus configuration 
has mixing time t^^^ > (1/4 + o(l))nlogn. ■ 

Remark. The analysis naturally extends to the continuous-time Glauber 
dynamics, where each site is associated with an independent Poisson clock 
of unit rate determining the update times of this site as above (note that the 
continuous dynamics is \V\ times faster than the discrete dynamics). We 
can use similar arguments to these used above to handel the laziness in the 
transition from the chain (Zf) to the chain (Yt). Namely, we could condition 
on the number of updates up to time t and then repeat the above arguments 
to establish that t^j^ > (1/4-1- o(l)) logn in the continuous-time case. 

Remark. We believe that Theorem 1 should have analogues (with t^[^ in 
place of i^ix) for the Ising model with arbitrary magnetic field, as well as 
for the Potts model and proper colorings. The first of these may be accessible 
to the methods of this paper, but the other two models need new ideas. 
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